Quantifying ChIP-seq data: a spiking method providing an internal reference for sample-to-sample normalization.
نویسندگان
چکیده
Chromatin immunoprecipitation followed by deep sequencing (ChIP-seq) experiments are widely used to determine, within entire genomes, the occupancy sites of any protein of interest, including, for example, transcription factors, RNA polymerases, or histones with or without various modifications. In addition to allowing the determination of occupancy sites within one cell type and under one condition, this method allows, in principle, the establishment and comparison of occupancy maps in various cell types, tissues, and conditions. Such comparisons require, however, that samples be normalized. Widely used normalization methods that include a quantile normalization step perform well when factor occupancy varies at a subset of sites, but may miss uniform genome-wide increases or decreases in site occupancy. We describe a spike adjustment procedure (SAP) that, unlike commonly used normalization methods intervening at the analysis stage, entails an experimental step prior to immunoprecipitation. A constant, low amount from a single batch of chromatin of a foreign genome is added to the experimental chromatin. This "spike" chromatin then serves as an internal control to which the experimental signals can be adjusted. We show that the method improves similarity between replicates and reveals biological differences including global and largely uniform changes.
منابع مشابه
Efficiently identifying genome-wide changes with next-generation sequencing data
We propose a new and effective statistical framework for identifying genome-wide differential changes in epigenetic marks with ChIP-seq data or gene expression with mRNA-seq data, and we develop a new software tool EpiCenter that can efficiently perform data analysis. The key features of our framework are: (i) providing multiple normalization methods to achieve appropriate normalization under d...
متن کاملA Unified Model for Differential Expression Analysis of RNA-seq Data via L1-Penalized Linear Regression
The RNA-sequencing (RNA-seq) is becoming increasingly popular for quantifying gene expression levels. Since the RNA-seq measurements are relative in nature, between-sample normalization of counts is an essential step in differential expression (DE) analysis. The normalization of existing DE detection algorithms is ad hoc and performed once for all prior to DE detection, which may be suboptimal ...
متن کاملAnalysis of ChIP-seq Data with ‘mosaics’ Package
This vignette provides an introduction to the analysis of ChIP-seq data with ‘mosaics’ package. R package mosaics implements MOSAiCS, a statistical framework for the analysis of ChIP-seq data, proposed in [1]. MOSAiCS stands for“MOdel-based one and two Sample Analysis and Inference for ChIP-Seq Data”. Based on careful investigation of biases in ChIP-seq data such as mappability and GC content, ...
متن کاملA highly efficient and effective motif discovery method for ChIP-seq/ChIP-chip data using positional information
Identification of DNA motifs from ChIP-seq/ChIP-chip [chromatin immunoprecipitation (ChIP)] data is a powerful method for understanding the transcriptional regulatory network. However, most established methods are designed for small sample sizes and are inefficient for ChIP data. Here we propose a new k-mer occurrence model to reflect the fact that functional DNA k-mers often cluster around ChI...
متن کاملSupplement Materials for Normalization of ChIP - seq data with control Kun
We demonstrate that a proper control sample correlates linearly with the background parts of its corresponding ChIP sample. In the following examples, we first draw the original ChIP vs control bins counts to show the over-abundance of high ChIP count bins due to binding signals. Then we filter the strong binding signals by calling peaks with SPP (Kharchenko et al., 2008) at FDR 0.1 level and e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genome research
دوره 24 7 شماره
صفحات -
تاریخ انتشار 2014